Hu.an s«r«ed Pn„ hl an , Po^^ln^ng the Safll 



United States 

Lockhart et ah 



[191 



USD05556752A 
(Jij Patent Number: 
[45] Date of Patent: 



Sill 

5,556,752 
Sep. 17, 1996 



(54] SURFACE-BOUND, UNIMOLECULAJL 
DOUBLE-STRANDED DNA 

[75] Inventors: David J. Lockhart, Santa Clara, Calif- 
Dirk Vetter. Freiburg, Germany; 
Martin Diggelmaan, Niederdorf, 
Switzerland 

[73] Assignee: Affymetrix, Inc, Santa Clara, Calif. 

[21] Appl. No.: 327,687 
[22] Red- Oct 24, 1994 

III] Jfc °nl " - Cl2Q 1/68; C07H 21 ™ 

[58J Field of Search 435/6; 535733.!; 

530/413 

f 5fi I References Cited 

US. PATENT DOCUMENTS 
075,110 3/1983 David et al. m 
062,157 12/1985 Lowe et al. . w , "I " 435/287^ 

4.728.502 3/1988 Hamil] 4wiifi 

3.143.834 9/1992 Kmmg ei al. ..... AZ\\l 

5.288.514 2/1994 Ellman ZZZZZ 

foreign patent documents 

WO89/I0977 11/1989 WIPO 

W089/11548 H/1989 WIPO 

WO90W0626 1/1990 WIPO 

WCW15Q70 12/1990 WIPO 

W 092/0009 J 1/1992 WIPO . 

OTHER PUBLICATIONS 

«a* . Cta L (1988 > An ^«l Biochemistry 169: 
nVl ?V * ^"""OThy of a Sequence-specific 
DNA binding protein using Teflon linked 

Ma. M. Y.-X. ct al (1993) Biochemistry 32: 1751-1758 
Dwign & Synthesis of RNA Miniduplicates via a synthetic 
linker approach. , 'Markiewic 2 , W T et al (1989) Nucleic 
Acids Research 17: 7149-7157. "Universal solid supports 
for the synthesis of oligonucleotides with 3 - P0 4 s". 



?&5ii« H if * CI993) Natl - AcaA Sci us * 9a 

0922-10926 "Complex Synthetic Chemical Libraries 

102:259-274(1987). 
Frank and Doring, Tetrahedron, 44:6031-6040 (1988) 
Fodor et al, Science, 251:767-777 (1991) 
Lara et al.. Nature. 354:82-84 (1991). 
Houghten et ai. f Nature, 354:84-86 (199^ 
Galas et al.. Nucleic Acid Res. 5(9):3 157-3 170 (1978) 
Murphy et al., Science 262:1025-1029 (1993) 
Lysov et al., DokL Akad. Ncuk SSSR, 303:1508^1511 (1988) 
(See footnote provided, P. 436). 
Bains et al., J. Theor Biol, 135:303-307 (1988). 
Drmanac et al., Genomics, 4:114-128 (1989) 
Strezoska et al., Proc. Nad. Acad Sci 
88:10089-10093 (1991). 
Drmanac et al.. Science. 260:1649-1652 (1993). 
Necdels. et al., Proc. Natl Acad Sci 
90:10700-10704 (1993). ' 

0993) P * V " ^ ^ 1 0fBU>l CHem " Um 5417-5423 

(List continued on next page.) ' " 

Primary Examiner— Mindy Fleisher 

Assistant Examiner—Scon Davjd Priebs 

Attorney, Ageni. or Firm— Townsend and Townsend and 

Crew LLP 



USA. 



USA, 



(57) 



ABSTRACT 



Libraries of unimolecular. double- stranded oligonucleotides 
on a solid support. These libraries are useful in pharmaceu- 
tical discovery for the screening of numerous biological 
samples for specific interactions between the double- 
stranded oligonucleotides, and peptides, proteins, drugs and 
RNA. ir a related aspect, the present invention provides 
libraries of con formation ally restricted probes on a solid 
support. The probes arc restricted in their movement and 
flexibility using double-stranded oligonucleotides as scaf- 
folding. The probes arc also useful in various screening 
procedures associated with drug discovery and diagnosis 
The preseni invention further provides methods for the 
preparation and screening of the above libraries. 

6 Claims, 1 Drawing Sheet 
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SURFACE-BOUND, UNIMOLECULAR, 
DOUBLE-STRANDED DNA 

GOVERNMENT RIGHTS 

Research leading to the invention was funded in part by 
NIH Gran; No. R01HG00813-03 and the government may 
have certain fights 10 the invention. 

BACKGROUND OF THE INVENTION 10 

The present invention relates to the field of polymer 
synthesis and the use of polymer libraries for biological 
screening. More specifically, in one embodiment the inven- 
tion provides arrays of diverse double-stranded oligonuclc- 15 
otide sequences. In another embodiment, the invention pro- 
vides arrays of conformational^ restricted probes, wherein 
the probes are held in position using double-stranded DNA 
sequences as scaffolding. Libraries of diverse unimolecular 
double-stranded nucleic acid sequences and probes may be 20 
used, for example, in screening studies for determinauon of 
binding affinity exhibited by binding proteins, drugs, or 
RNA. 

Methods of synthesizing desired single stranded DNA 
sequences are well known to those of skill in the art. In 
particular, methods of synthesizing oligonucleotides are 
found in. for example, Oligonucleotide Synthesis: A Prac- 
tical Approach, Gait, ed.. IRL Press, Oxford (1984). incor- 
porated herein by reference in its entirety for all purposes. 
Synthesizing unimolecular double -stranded DNA in solution 
has also been described. Sec, Durand, et al. Nucleic Acids 
Res. 18:6353-6359 (1990) and Thomson, et al. Nucleic 
Acids Res. 21:5600-5603 (1993), the disclosures of both 
being incorporated herein by reference. ^ 

Solid phase synthesis of biological polymers has been 
evolving since the early "Merrincld" solid phase peptide 
synthesis, described in Merrifield, X Am. Chem. Soc. 
85:2149-2154 (1963), incorporated herein by reference for 
all purposes. Solid-phase synthesis techniques have been ^ 
provided for the synthesis of several peptide sequences on. 
for example, a number of •'pins. 1 ' See e.g., Geysen et al., V. 
Irrvnun. Meth. 102:259-274 (1987), incorporated herein by 
reference for all purposes. Other solid-phase techniques 
involve, for example, synthesis of various peptide sequences A5 
on different cellulose disks supported in a column. See Frank 
and Doring. Tetrahedron 44:6031-6040 (1988). incorpo- 
rated herein by reference for all purposes. Still other sol id - 
phase techniques arc described in U.S. Pat. No. 4,728.502 
issued to Hamill and WO 90/00626 (Bcattic, inventor). J0 

Each of the above techniques produces only a relatively 
low density array of polymers. For example, the technique 
described in Geysen et al. is limited to producing 96 
different polymers on pins spaced ir the dimensions of a 
standard microliter plate. 53 

Improved methods of forming large arrays of oligonucle- 
otides, peptides and other polymer sequences in a short 
period of rime have been devised. Of particular note. Pirrung 
et al.. U.S. PaL No. 5,143,854 (see also PCT Application No. 
WO 90V 15070) and Fodor et al., PCT Publication No. WO 60 
92/10092, all incorporated herein by reference, disclose 
methods of forming vast arrays of peptides, oligonucleotides 
and other polymer sequences using, for example, light- 
directed synthesis techniques. See also. FodoT ct al.. Science, 
251:767-777 (1 99 1), also incorporated herein by reference 65 
for all purposes. These procedures are now referred to as 
VLSIPS™ procedures. 
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In the above-referenced Fodor et aL, PCT application, an 
elegant method is described for using a computer-controlled 
system to direct a VLSIPS™ procedure. Using this 
approach, one heterogenous array of polymers is converted 
through simultaneous coupling at a number of reaction sites, 
into a different heterogenous array. See, U.S. PaL. No. 
5.384,261 and U.S. application Ser. No. 07/980,523, the 
disclosures of which are incorporated herein for all pur- 
poses. 

The development of VLSIPS™ technology as described 
in the above-noted U.S. Pal No. 5.143.854 and PCT patent 
publication Nos. WO 90/15070 and 92/10092. is considered 
pioneering technology in the fields of combinatorial synthe T 
sis and screening of combinatorial libraries. More recently, 
patent application Set No. 0M»937, filed Jun. 25, 1993 
now abandoned, describes methods for making arrays of 
oligonucleotide probes that can be used to check or deter- 
mine a partial or complete sequence of a target nucleic acid 
and to detect the presence of a nucleic add containing a 
specific oligonucleotide sequence. 

A number of biochemical processes of pharmaceutical 
interest involve the interaction of some species, e.g., a drug, 
a peptide or protein, or RNA, with double-stranded DNA. 
For example, protcin/DNA binding interactions are involved 
with a number of transcription factors as well -as tumor 
suppression associated with the p53 protein and the genes 
contributing to a number of cancer conditions. 

SUMMARY OF THE INVENTION 

High-density arrays of diverse unimolecular, double- 
stranded oligonucleotides, as well as arrays of conforma- 
lionally restricted probes and methods for their use are 
provided by virtue of the present invention. In addition, 
mclhods and devices for detecting duplex formation of 
oligonucleotides on an array of diverse single-stranded 
oligonucleotides arc also provided by this invention. Fur- 
ther, an adhesive based on the specific binding characteris- 
tics of two arrays of complementary oligonucleotides is 
provided in the present invention- 
According to one aspect of the present invention, libraries 
of unimolecular, double-stranded oligonucleotides arc pro- 
vided. Each member of the library is comprised of a solid 
support, an optional spacer for attaching the double- stranded 
oligonucleotide to the support and for providing sufficient 
space between the double-stranded oligonucleotide and the 
solid support for subsequent binding studies and assays, an 
oligonucleotide attached to the spacer and further attached to 
a second complementary oligonucleotide by means of a 
flexible linker, such thai the two oligonucleotide portions 
exist in a double-stranded configuration. More particularly, 
the members of the libraries of the present invention can be 
represented by the formula: 

Y-L'-X'-L'-X* 

in which Y is a solid support. C is a bond or a spacer. L J is 
a flexible linking group, and X 1 and X 2 are a pair of 
complementary oligonucleotides. 

In a specific aspect of the invention, the library of 
different unimolecular. double-stranded oligonucleotides 
can be used for screening a sample for a species which binds 
to one or more members of the library. 

In a related aspect of the invention, a library of different 
COTforrnati on ally-restricted probes attached to a solid sup- 
port is provided. The individual members each have the 
formula: 
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-x n -z-x 12 

in which X 11 and X" arc complementary oligonucleotides 
and 2 is a probe having sufficient length such thai X 11 and 
X n form a double -stranded oligonucleotide portion of the 5 
member and thereby restrict the conformations available to 
the probe. In a specific aspect of the invention, the library of 
different conforrnauonally-restricted probes can be used for 
screening a sample for a species which binds to one or more 
probes in the library. io 

According to yet another aspect of the present invention, 
methods and devices for the bioclectronic detection of 
duplex formation arc provided. 

According to still another aspect of the invention, an 
adhesive is provided which comprises two surfaces of 15 
complementary oligonucleotides. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIGS. 1 A to IF illustrate the preparation of a member of 



"light-directed" synthesis, discussed below, the protecting 
groups will be photolabilc protecting groups such as NVOC, 
MeNPOC, and those disclosed in co-pending Application 
PCT/US93/10162 (filed Oct. 22, 1993). incorporated herein 
by reference. In other methods, protecting groups may be 
removed by chemical methods and include groups such as 
FMOC DMT and others known to those of skill id the an. 

Complementary or substantially complementary: Refers 
to the hybridization or base pairing between nucleotides or 
nucleic acids, such as, for instance, between the two strands 
of a double stranded DNA molecule or between an oligo- 
nucleotide primer and a primer binding site on a single 
stranded nucleic arid to be sequenced or amplified Comple- 
mentary nucleotides are, generally, A and T (or A and U). or 
C and G. Two single stranded RNA or DNA molecules are 
said to be substantially complementary when the nudeoiidcs 
of one strand, optimally aligned and compared and with 
appropriate nucleotide insertions or deletions, pair with at 
least about 80% of the nucleotides of the other strand. 



FIGS. I A to IP illustrate tnc preparation oi i memocr oi ^ iQ ^ ^ morc froin 

a library or surface-bound, ummolccular doublc-strandcd M 9g l0 ioo%. 

Alternatively, substantial complementary exists when an 



DNA as well as binding studies with receptors having 
specificity for cither the double stranded DNA portion, a 
probe which is held in a conformational ly restricted form by 
DNA scaffolding, or a bulge or loop region of RNA. 



DESCRIPTION OF THE PREFERRED 
EMBODIMENT 

Abbreviations 

The following abbxviations arc used herein: phi. phenan- 30 
threncquinone diiminc; phen", 5-arnido-gluiaric acid- 1,10- 
phcnamhrolinc; dppz, dipyridophenazinc. 
Glossary 

The following terms arc intended to have the following 
general meanings as they arc used herein: 35 

Chemical terms: As used herein, the term "alkyP refers to 
a saturated hydrocarbon radical which may be straight-chain 
or brancned-chain (for example, ethyl, isopropyl. t-amyl. or 
2.5-dimcthylhcxyl). When "alky!" or "alkylcnc" is used to 
refer to a linking group or a spacer, it is taken to be a group 40 
having two available valences for covalcni attachment, for 
cxampl:. -CH 3 CH 2 -. -CHjCH a CH 2 -. 

-CH 3 CH ? CH(CH3)CH 2 - and — CHjCCHjCH^CHj— . 
Preferred alkyl groups as substiiucnis arc those containing 1 
to 10 carbon atoms, with those containing I to 6 carbon 45 
atoms being particularly preferred. Preferred alkyl or alky- 
lcnc groups as linking groups arc those containing I to 20 
carbon atoms, with those containing 3 to 6 carbon atoms 
being particularly preferred. The term "polyethylene glycol" 
is used to refer to those molecules which have repeating » 
units of ethylene glycol, for example, hcxacthylcnc glycol 
(HO — (CHjCHjO) 3 — CH 2 CH 2 OH). When the term "poly- 
ethylene glycol" is used to refer to linking groups and spacer 
groups, it would be understood by one of skill in the art that 
other potycthcrs or polyols could be used as well (i. c. 33 
polypropylene glycol or mixtures of ethylene and propylene 
glycols). 

The term "protecting group" as used herein, refers to any 
of the groups which arc designed to block one reactive site 
in a molecule while a chemical reaction is carried out at 60 
another reactive site. Morc particularly, the protecting 
groups used herein can be any of those groups described in 
Greene, a ol.. Protective Croups In Organic Chemistry , 2nd 
Ed..John Wiley & Sons, New York, N. Y, 1991.incorporaied 
herein by reference. The proper selection of protecting 65 
groups for a particular synthesis will be governed by the 
overall methods employed in the synthesis. For example, in 



RNA or DNA strand will hybridize under selective hybrid- 
ization conditions to its complement Typically, selective 
hybridization will occur when there is at least about 65% 
complementary over a stretch of at least 14 to- 25 nucle- 
otides, preferably at least about 75%. more preferably at 
least about 90% complementary. S. cc, M. Kanchisa Nucleic 
Acids Res. 1 2:203 (1984), incorporated herein by reference. 

Stringent hybridization conditions will typically include 
salt concentrations of less than about 1M, more usually less 
than about 500 mM and preferably less than about 200 mM. 
Hybridization temperatures can be as low as 5° C. but arc 
typically greater than 22* C, morc typically greater than 
about 30° C, and preferably in excess of about 37° C 
Longer fragments may require higher hybridization tem- 
peratures for specific hybridization. As other factors may 
affee: the stringency, of hybridization, including base com- 
position and length of the complementary strands, presence 
of organic solvents and cxicn; of base mismatching, the 
combination of parameters is more important than the abso- 
lute measure of any one alone. 

Epitope: The ponion of an antigen molecule which is 
delineated by the area of interaction with the subclass of 
receptors known as antibodies. 

Identifier tag: A means whereby one can identify which 
molecules have experienced a particular reaction in the 
synthesis of an oligomer. Tnc identifier tag also records the 
step in the synthesis scries in which the molecules experi- 
enced ihat particular monomer reaction. The iocmificr ug 
may be any recognizable feature which is. for example: 
microscopically distinguishable in shape, size, color, optical 
density, etc.; differently absorbing or emitting of light; 
chemically reactive; magnetically or electronically encoded; 
or in some other way distinctively marked with the required 
information. A preferred example of such an identifier tag is 
an oligonucleotide sequence. 

Ugand/Probc: A ligand is a molecule chat is recognized by 
a particular receptor. The agent bound by or reacting with a 
receptor is called a "ligand.** a term which is dcfinitionally 
meaningful only in terms of its counterpart receptor. The 
term "ligand" docs not imply any particular molecular size 
or other structural or compositional feature other than that 
the substance in question is capable of binding or otherwise 
interacting with the receptor. Also, a ligand may serve cither 
as the natural ligand to which the receptor binds, or as a 
functional analogue that may act as an agonist or antagonist. 
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Example* of ligands that can be investigated by this inven- 
tion include, but are not restricted to, agonists and antago- 
nists for eel] membrane receptors, toxins and venoms, viral 
epiiopes, hormones (e.g.. opiates, sieroids, etc.). hormone 
receptors, peptides, enzymes, enzyme substrates, substrate 5 
analogs, transition state analogs, cofactors, drugs, proteins, 
and antibodies. The term "probe" refers to those molecules 
which are expected to act like ligands but for which binding 
information is typically unknown. For example, if a receptor 
is known to bind a ligand which is a peptide p-tum, a ic 
"probe" or library of probes will be those molecules 
designed to mimic the peptide p-turn. In instances where the 
particular ligand associated with a given receptor is 
unknown, the term probe refers to those molecules designed 
as potential iigands for the receptor. 15 

Monomer Any member of the set of molecules which can 
be joined together to form an oligomer or polymer. The set 
of monomers useful in the present invention includes, but is 
not restricted to, for the example of oligonucleotide synthe- 
sis, the set of nucleotides consisting of adenine, thymine, 20 
cytosine, guanine, and uridine (A, T, C G, and U, respec- 
tively) and synthetic analogs thereof. As used herein, mono- 
mers refers to any member of a basis set for synthesis of an 
oligomer. Different basis sets of monomers may be used at 
successive steps in the synthesis of a polymer. 25 

Oligomer or Polymer. The oligomer or polymer 
sequences of the present invention are formed from the 
chemical or enzymatic addition of monomer subunits. Such 
oligomers include, for example, both linear, cyclic, and 
branched polymers of nucleic acids, polysaccharides, phos- 30 
pholipids, and peptides having cither o>" p-, or u>-amino 
acids, hctcropolymcrs in which a known drug is covalcruly 
bound to any of the above, polyurethanes, polyesters, poiy. 
carbonates, polyureas, polyamides. polyethylene) mine's, 
polyarylcne sulfides, polysiloxanes, polyimides, polyac- 33 
ctatcs, or other polymers which will be readily apparent to 
one skilled in the an upon review of this disclosure. As used 
herein, the term oligomer or polymer is meant to include 
such molecules as p-tum mimctics, prostaglandins and ben- 
zodiazepines which can also be synthesized in a stepwise 40 
fashion on a solid support. 

Peptide: A peptide is an oligomer in which the monomers 
arc amino acids and which arc joined together through 
amide bonds and alternatively referred to as a polypcptid;. 
In the context of this specification 11 should be appreciated 45 
that when a-amino acids are used, they may be the L-optical 
isomer or the D-optical isomer. Other amino acids which arc 
useful in the present invention include unnatural amino acids 
such a p-alaninc, phcnylglycinc, homoarginine and the lix:. 
Peptidss arc more than two amino arid monomers long, and 50 
often more than 20 amino acid monomers long. Standard 
abbreviations for amino acids arc used (e.g., P for proline). 
These abbreviations arc included in Stryer, Biochemistry, 
Third Ed, (1988), which is incorporated herein by reference 
for all purposes. 33 

Oligonucleotides: An oligonucleotide is a single-strar.ded 
DNA or RNA molecule, typically prepared by synthetic 
means. Alternatively, naturally occurring oligonucleoLides. 
or fragments thereof, may be isolated from their natural 
sources or purchased from commercial sources. Those oli- 60 
gonucleotides employed in the present invention will be 4 to 
100 nucleotides in length, preferably from 6 to 30 nucle- 
otides, although oligonucleotides of different length may be 
appropriate. Suitable oligonucleotides may be prepared by 
the phosphoramiditc method described by Bcaucage and 65 
Carruthen, Tetrahedron Leu.. 22:1859-1862 (1981), or by 
the triestcr method according to Matteucci, et ah, / An 



Chem. Soc, 103:3185 (1981), both incorporated herein by 
reference, or by other chemical methods using either a 
commercial automated oligonucleotide synthesizer or 
VLSfPS 11 * technology (discussed in detail below). When 
oligonucleotides are referred to as "double-stranded." it is 
understood by those of skill in the art that a pair of 
oligonucleotides exist in a hydrogen-bonded, helical array 
typically associated with, for example, DNA. In addition to 
the 100% complementary form of double-stranded oligo- 
nucleotides, the term "double- stranded" as used herein is 
also meant to refer to those forms which include such 
structural features as bulges and bops, described more fully 
in such biochemistry texts as Stryer, Biochemistry, Third 
Ed., (1 988), previously incorporated herein by reference for 
all purposes. 

Receptor. A molecule that has an affinity for a given 
ligand or probe. Receptors may be riarnraDy-occurring or 
man made molecules. Also, they can be employed in their 
unaltered natural or isolated state or as aggregates with other 
species. Receptors may be attached, covalently or nonco- 
valentiy, to a binding member, either directly or via a 
specific binding substance. Examples of receptors which can 
be employed by this invention include, bu: are not restricted 
to, antibodies, cell membrane receptors, monoclonal anti- 
bodies and antiscra reactive with specific antigenic deter- 
minants (such as on viruses, cells or other materials), drugs, 
polynucleotides, nucleic acids, peptides, cofactors, lectins, 
sugars, polysaccharides, cells, cellular membranes, and 
organelles. Receptors are sometimes referred to in the art as 
anri-ligands. As the term receptors is used herein, no differ- 
ence in meaning is intended. A "bgand-receptor pair" is 
formed when two molecules have combined through 
molecular rccogniuon 10 form a complex. Other examples of 
receptors which can be investigated by this invention 
include but arc not restricted to: 

a) Microorganism receptors: Determination ofligands or 
probes that bind to receptors, such as specific transport 
proteins or enzymes essential to survival of microor- 
ganisms, is useful in a new class of antibiotics. Of 
particular value would be antibiotics against opporm- 
nistic fungi, protozoa, and those bacteria resistant to the 
antibiotics in current use 

b) Enzymes: For instance, the binding site of enzymes 
such as the enzymes responsible for cleaving neu- 
rotransmitters. Determination of ligands or probes that 
bind to certain receptors, and thus modulate the action 
of the enzymes that cleave the different ncurotrans mil- 
lers, is useful in the development of drugs that can be 
used in the treatment of disorders of neurotransmission. 

c) Antibodies: For instance, the invention may be useful 
in investigating the ligand -binding site on the antibody 
molecule which combines with the epitope of an anti- 
gen of interest. Determining a sequence that mimics an 
antigenic epitope may lead to the development of 
vaccines of which the immunogen is based on one or 
more of such sequences, or lead to the development of 
related diagnostic agents or compounds useful in thera- 
peutic treatments such as for autoimmune diseases 
(e.g.. by blocking the binding of the "self" antibodies). 

d) Nucleic Acids: The invention may be useful in inves- 
tigating sequences of nucleic acids acting as binding 
sites for cellular proteins ("trans -acting factors"). Such 
sequences may include, e.g., transcription factors, sup- 
pressors, enhancers or promoter sequences. 

c) Catalytic Polypeptides: Polymers, preferably polypep- 
tides, which are capable of promoting a chemical 
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reaction involving the conversion of one or more 
react ants 10 one or more products. Such polypeptides 
generally include a binding site specific for at least one 
reactant or reaction intermediate and an active func- 
tionality proximate to the binding site, which function- 5 
ality is capable of chemically modifying the bound 
reactant. Catalytic polypeptides are described in, 
Lcmcr. R.A. ct al.. Science 252: 659 (1991). which is 
incorporated herein by reference. 
0 Hormone receptors: For instance, the recepiors for 10 
insulin and growth hormone. Determination of the 
ttgands which bind with high affinity to a receptor is 
useful in the development of, for example, an oral 
replacement of the daily injections which diabetics 
must take to relieve the symptoms of diabetes, and in 15 
the other case, a replacement for the scarce human 
growth hormone that can only be obtained from cadav- 
ers or by recombinant DNA technology. Other 
examples arc the vasoconstrictive hormone receptors; 
determination of those ligands that bind to a receptor 20 
may lead to the development of drugs to control blood 
pressure. 

g) Opiate recepiors: Determination of ligands that bind to 
the opiate receptors in the brain is useful in the devel- 
opment of less-addictive replacements for morphine 
and related drugs. 
Substrate or Solid Support: A material having a rigid or 
semi-rigid surface. Such materials will preferably take the 
form of plates or slides, small beads, pellets, disks or other 
convenient forms, although other forms may be used. In 
some embodiments, at least one surface of the substrate will 
be substantially flat. In other embodiments, a roughly spheri- 
cal shape is preferred. 

Synthetic: Produced by in vitro chemical or enzymatic 
synthesis. The synthetic libraries of the present invention 
may be contrasted with those in viral or plasmid vectors, for 
instance, which may be propagated in bacterial, yeast, or 
other living hosts. 



DESCRIPTION OF THE INVENTION 

The broad concept of the pxseni invention is illustrated in 
FIGS. 1A to IF. FIGS. 1A, IB and 1C illustrate the prepa- 
ration of surface-bound uni molecular double stranded DNA, 45 
while FIGS. ID. IE. and IF illustrate uses for the libraries 
of the present invention. 

FIG. 1 A shows a solid support 1 having an attached spacer 
2, which is optional. Attached to the distal end of the spacer 
is a first oligomer 3, which can be attached as a single unit 50 
or synthesized on the support or spacer in a monomer by 
monomer approach. FIG. IB shows a subsequent stag: in 
the preparation of one member of a library according to the 
present invention. In this stage, a flexible linker 4 is attached 
to the distal end of the oligomer 3. In other embodiments, the 55 
flexible linker will be a probe. FIG. 1C shows the completed 
surface-bound uni molecular double stranded DNA which is 
one member of a library, wherein a second oligomers is now 
attached to the distal end of the flexible linker (or probe). As 
shown in FIG. 1C. the length of the flexible linker (or probe) 60 
4 is sufficient such that the first and second oligomers (which 
arc complementary) exist in a double-stranded conforma- 
tion. It will be appreciated by one of skill in ihc art. that the 
libraries of the present invention will contain multiple, 
individually synthesized members which can be screened for 65 
various types of activity. Three such binding events are 
illustrated in FIGS. I D. IE and IF. 
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In FIG. ID. a receptor 6. which can be a protein, RNA 
molecule or other molecule which is known to bind to DNA, 
is introduced to the library. Determining which member of 
a library binds to the receptor provides information which is 
useful for diagnosing diseases, sequencing DNA or RNA, 
identifying genetic characteristics, or in drug discovery. 

In FIG. IE, the tinker 4 is a probe for which binding 
information is sought. The probe is held in a conformation- 
ally restricted manner by the flanking oligomers 3 and 5. 
which arc present in a double- stranded conformation. As a 
result, a library of conformationally restricted probes can be 
screened for binding activity with a receptor 7 which has 
specificity for the probe. 

The present invention also contemplates the preparation 
of libraries of uni molecular, double -stranded oligonucle- 
otides having bulges or loops in one of the strands as 
depicted in FIG. IF. In FIG. IF, one oligonucleotide 5 is 
shown as having a bulge 8. Specific RNA bulges arc often 
recognized by proteins (e.g., TAR RNA is recognized by the 
TAT protein of HIV). Accordingly, libraries of RNA bulges 
or loops are useful in a number of diagnostic applications. 
One of skill in the art will appreciate that the bulge or loop 
can be present in either oligonucleotide portion 3 or 5. 
Libraries of Uni molecular, Double -Stranded Oligonucle- 
otides 

In one aspect, the present invention provideV libraries of 
unimolecular double-stranded oligonucleotides, each mem- 
ber of the library having the formula: 

Y— L.'— X 1 — L 1 — X 1 

in which Y represents a solid suppon, X 1 and X 3 represent 
a pair of complementary oligonucleotides. L ; represents a 
bond or a spacer, and L 5 represents a Jinking group having 
sufficient length such thai X 1 and X 2 form a double- sounded 
oligonucleotide. 

The solid support may be biological, nonbiological, 
organic, inorganic, or a combination of any of these, existing 
□s panicles, strands, precipitates, gels, sheets, tubing, 
spheres, containers, capillaries, pads, slice*, films, plates, 
slides, etc. The solid support is preferably flat but may lake 
on alternative surface configurations. For example, the solid 
support may contain raised or depressed regions on which 
synthesis takes place. In some embodiments, the solid 
support will be chosen to provide appropriate light-absorb- 
ing characteristics. For example, the suppon may be a 
polymerized Langmuir Blodgcti film, functional! zed glass. 
Si, Gc, GaAs, GaP, SiO } . SiN«, modified silicon, or any one 
of a variety of gels or polymers such as (poly)tciralluDro- 
ethylene, (poly)vinylidcndi fluoride, polystyrene, polycar- 
bonate, or combinations thereof. Other suitable solid suppon 
materials wilt be readily apparent to those of skill in the an. 
Preferably, the surface of the solid suppon will contain 
reactive groups, which could be carboxyl. amino, hydroxyl, 
thiol, or the like. More preferably, the surface will be 
optically transparent and will have surface Si — OH func- 
tionalities, such as arc found on silica surfaces. 

Attached to the solid support is an optional spacer, L 1 . The 
spacer molecules are preferably of sufficient length to permit 
the double-stranded oligonucleotides in the completed mem- 
ber of the library to interact freely with molecules exposed 
to the library. The spacer molecules, when present, arc 
typical l>6— 50 atoms long to provide sufficient exposure for 
the attached double-stranded DNA molecule. The spacer. L\ 
is comprised of a surface attaching ponton and a longer 
chain portion. The surface attaching portion is that pan of L 1 
which is directly attached to the solid suppon. This portion 
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can be attached to the solid support via carbon-carbon bonds 
using, for example, supports having (rx>ly)trmuorochloro- 
ethylene surfaces, or preferably, by siloxane bonds (using, 
for example, glass ot silicon oxide as the solid support). 
S il oxane bonds with the surface of the support are formed in 5 
one embodiment via reactions of surface attaching portions 
bearing trichlorosilyl or trialkoxysilyl groups. The surface 
attaching groups will also have a site for attachment of the 
longer chain portion. For example, groups which are suitable 
for attachment to a longer chain portion would include 10 
amines, hydroxyl, thiol, and carboxyl. Preferred surface 
attaching portions include aminoalkylsilanes and bydroxy- 
alkylsilanes. In particularly preferred embodiments, the sur- 
face attaching portion of L 1 is either bb(2-hydroxyethyl)- 
aminopropyltriethoxysilane, 15 
2.hydroxyemylarnjrK)propyltriethoxysilane. aminopropyltri- 
ethoxysilane or bydroxypropyltrietboxysilane. 

The longer chain portion can be any of a variety of 
molecules which arc inert lo the subsequent conditions for 
polymer synthesis. These longer chain portions will typi- 20 
cally be aryl acetylene, ethylene glycol oligomers containing 
2-14 monomer units, diamines, diacids, amino acids, pep- 
tides, or combinations thereof. In some embodiments, the 
longer chain portion is a polynucleotide. The longer chain 
portion which is to be used as part of L 1 can be selected 25 
based upon its hydrophilic/bydrophobic properties to 
improve presentation of the double-stranded oligonucle- 
otides to certain receptors, proteins or drugs. The longer 
chain portion of L 1 can be constructed of polyethylenegly- 
cols. polynucleotides, alkylenc. poly alcohol, polyester, 
polyaminc. polyphosphodiester and combinations thereof. 
Additionally, for use in synthesis of the libraries of the 
invention. L 1 will typically have a protecting group, attached 



io a functional group (i.e., hydroxyl. amino or carboxylic 
acid) on the distal or terminal end of the chain portion M 
(opposite the solid support). After deprotection and cou- 
pling, the distal end is covalently bound to an oligomer. ^ 
Attached to the distal end of L 1 is an oligonucleotide. X , 
which is a single -stranded DNA or RNA molecule. The 
oligonucleotides which are pan of the present invention arc *o 
typically of from about 4 to about 100 nucleotides in length. 
Preferably. X' is an oligonucleotide which is about 6 to 
about 30 nucleotides in length. The oligonucleotide is typi- 
cally linked to L' via the 3'-hydroxyl group of the oligo- 
nucleotide and a functional group on L 1 which results in the « 
rormatior. of an ether, ester, carbamate or phosphate ester 
linkage. 2 
Attached to the distal end of X 1 is a linking group. L . 
which is flexible and of sufficient length that X 1 can effec- 
tively hybridize with X 2 . The length or the linker will 30 
typically be a length which is at least the length spanned by 
two nucleotide monomers, and preferably at least four 
nucleotide monomers, while not be so long as to interfere 
with cither the pairing of X ! and X 1 or any subsequent 
assays. Tnc linking group itself will typically be an alkylene 55 
group (of from about 6 to about 24 carbons in length), a 
polyeihyleneglycol group (of from about 2 to about 24 
cthyleneglycol monomers in a linear configuration), a poly- 
alcohol group, a polyaminc group (e.g.. spermine, spermi- 
dine and polymeric derivatives thereof), a polyester group 60 
(e.g.. polyvinyl acrylate) having of from 3 to 15 ethyl 
acrylate monomers in a linear configuration), a polyphos- 
phodiester group, or a polynucleotide (having from aboul 2 
to aboul 12 nucleic adds). Preferably, the linking group will 
be a polyeihyleneglycol group which is at least a tetraeth- 65 
yleneglyco!, and more preferably, from about 1 to 4 hcxa- 
ethyleneglycols linked ir. a linear array. For use in synthesis 



of the compounds of the invention, the linking group will be 
provided with functional groups which can be suitably 
protected or activated. The linking group will be covalcnti v 
attached to each of the complementary oligonucleotides, X 
and X J by means of an ether, ester, carbamate, phosphate 
ester or amine linkage. The flexible linking group L will be 
attached W & c ^-hydroxyl of the terminal monome^of X 
and to the 3'*hydroxyl of the initial monomer of X . Pre- 
ferred linkages are phosphate ester linkages which can be 
formed in the same manner as the oligonucleotide linkages 
which are present in X 1 and X 2 . For example, hexaethyl- 
cneglycol can be protected on one terminus with a photo- 
labile protecting group (i.c, NVOC or MeNPOQ and 
activated on the other terminus with 2-cyanoetbyl-N.N- 
miscrpropylanuno^orophosphite to form a phosphoramid- 
ite. This Unking group can then be used for construction of 
the libraries in the same manner as the photolabilc-protected, 
phosphonunidite-activated nucleotides. Alternatively, ester 
linkages to X 1 and X 3 can be formed when the L has 
terminal carboxylic acid moieties (using the 5'-hydroxyl of 
X 1 and the T-hydroxyl of X 2 ). Other methods of forming 
ether, carbamate or amine linkages are known to those of 
skill in the art and particular reagents and references can be 
found in such texts as March. Advanced Organic Chemistry* 
4th Ed.. WUey-lnterscience. New York. N.Y, 1992, incor- 
porated herein by reference. 

The oligonucleotide, X 2 , which is covalently attached to 
the distal end of the linking group is. like X 1 . a single- 
stranded DNA ot RNA molecule. The oligonucleotides 
which are part of the presen; invention are typically of from 
30 about 4 to about 100 nucleotides in length. Preferably. X is 
an oligonucleotide which is aboul 6 to aboul 30 nucleotides 
in length and exhibits complementary to X 1 of from 90 to 
100%. More preferably. X' and X 2 arc 100* complemen- 
tary. In one group of embodiments, either X or X will 
further comprise abulgc or loop portion and exhibit comple- 
mentary of from 90 to 100% over the remainder of the 
oligonucleotide. 

In a particularly preferred embodiment, the solid support 
is a silica support, the spacer is a polyeihyleneglycol con- 
jugated to an aminoalkylsilanc the linking group is a 
polyeihyleneglycol group, and X 1 and X 2 are complemen- 
tary oligonucleotides each comprising of from 6 io 30 
nucleic acid monomers. 

The library can have virtually any number of different 
members, and will be limned only by the number or variety 
of compounds desired io be screened in a given application 
and by the synthetic capabilities of the practitioner In one 
group of embodiments, the library will have from 2 up io 
100 members. In other groups of embodiments, the library 
will have between 100 and 10000 members, and between 
1 0000 and 1 000000 members, preferably on a solid support. 
In preferred embodiments, the library will have a density of 
more than 100 members at known locations per cm , pref- 
erably more than 1000 per cm 1 , more preferably more than 
10.000 per cm 7 . 

Libraries of Coriforrnaiionally Restricted Probes 

In still another aspect, the present invention provides 
libraries of conformational iy -restricted probes. Each of the 
members of the library comprises a solid support having an 
optional spacer which is attached to an oligomer of the 
formula: 



_x"-z-x u 

in which X" and X' 2 arc compleraeniary oligonucleotides 
and Z is a probe. The probe will have sufficient length such 
that X" and X 13 form a double-stranded DNA portion of 
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each member. X" and X' 2 arc as docribcd above for X 1 and 
X 1 respectively, except thai for the present aspect of the 
invention, each member of (he probe library can have the 
same X" and the same X 13 , and differ only in the probe 
portion. In one group of embodiments. X" aod X" are 5 
cither a poly-A oligonucleotide or a poly-T oligonucleotide. 

As noted above, each member of the library wQl typically 
have a different probe portion. The probes, Z, can be any of 
a variety of structures for which receptor-probe binding 
information is sought for conformational ly-restricted forms. 10 
For example* the probe can be an agonist or antagonist for 
a cell membrane receptor, a toxin, venom, vital epitope 
hormone, peptide, enzyme collector, drug, protein or anti- 
body. In one group of embodiments, the probes are different 
peptides, each having or from about 4 to about 12 amino is 
acids. Preferably the probes wilt be linked via polyphos- 
phate dicsters, although other linkages arc also suitable. For 
example, the last monomer employed on the X u chain can 
be a S'-aminopropyl-functionalized phosphorarnidile nucle- 
otide (available from Glen Research, Sterling, Va., USA or 20 
Gcnosys Biotechnologies, The Woodlands, Tex., USA) 
which will provide a synthesis initiation site for the carboxy 
to amino synthesis of the pepudc probe. Once the peptide 
probe is formed, a 3-succinylated nucleoside (from Cru- 
achem, Sterling, Va„ USA) will be added under peptide 25 
coupling conditions. In yet another group of embodiments, 
the probes will be oligonucleotides of from 4 to about 30 
nucleic acid monomers which will form a DNA or RNA 
hairpin structure. For use in synthesis, the probes can also 
have associated functional groups (i.e.. hydroxy!, amino. 30 
carboxylic acid, anhydride and derivatives thereof) for 
attaching two positions on the probe to each of the comple- 
mentary oligonucleotides. 

The surface of the solid support is preferably provided 
with a spacer molecule, although it will be understood that 35 
the spacer molecules arc not elements of this aspect of the 
invention. Where present, the spacer molecules will be as 
described above for L 1 . 

The libraries of conformational!)- restricted probes can 
also have virtually any number of members. As above, the <o 
number of members will be limited only by design of the 
particular screening assay for which the library will be used, 
and by the synthetic capabilities of the practitioner. In one 
group of embodiments. the library will have from 2 to 100 
member*. In other groups of embodiments, the library will 45 
have between 100 and 10000 members, and between 10000 
and 1000000 members. Also as above, in preferred embodi- 
ments, the library will have a density of more than 100 
members at known locations per cm ? . preferably more than 
1000 per cm 7 , more preferably more than 10,000 per cm 2 . 50 
Preparation of the Libraries 

The present invention further provides methods for the 
preparation of diverse unimoleculax, double-sirandcd oligo- 
nucleotides on a solid support. In one group or embodi- 
ments, the surface of a solid support has a plurality of 55 
preselected regions. An oligonucleotide of from 6 to 30 
monomers is formed on each of the preselected regions. A 
linking group is then attached to the distal end of each of the 
oligonucleotides. Bully, a second oligonucleotide is 
formed on the distal end of each linking group such that the 60 
second oligonucleotide is complementary to the oligonucle- 
otide already present in the same preselected region. The 
linking group used will have sufficient length such thai the 
complementary oligonucleotides form a unimolccular, 
doublc-strandcd oligonucleotide. In another group of 65 
embodiments, each chemically distinct member of the 
library will be synthesized on a separate solid support. 
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Libraries on a Single Substrate 
Light-Directed Methods 

For those embodiments using a single solid support, the 
oligonucleotides of the present invention can be formed 
using a variety of techniques known to those skilled in the 
art of polymer synthesis on solid supports. For example, 
**hght directed" methods (which are one technique in a 
family of methods known as VLSIPS™ methods) are 
described in U.S. Pat No. 5.143.854, previously incorpo- 
rated by reference. The light directed methods discussed in 
the '854 patent involve activating predefined regions of a 
substrate or solid support and then contacting the substrate 
with a preselected monomer solution. The predefined 
regions can be activated with a light source, typically shown 
through a mask (much in the manner of photolithography 
techniques used in integrated circuit fabrication). Other 
regions of the substrate remain inactive because they arc 
blocked by the mask from illumination and remain chemi- 
cally protected Thus, s light pattern defines which regions 
of the substrate react with a given monomer. By repeatedly 
activating different sets of predefined regions and contacting 
different monomer solutions with the substrate, a diverse 
array of polymers is produced on the substrate. Of course, 
other steps such as washing unrcactcd monomer solution 
from the substrate car be used as necessary. Other tech- 
niques include mechanical techniques such- -as those 
described in PCT No. 92/10183. U.S. Pat. No. 5384,261 
also incorporated herein by reference for all purposes. Still 
further techniques include bead based techniques such as 
those described in PCT US/93/04145, also incorporated 
herein by reference, and pin based methods such as those 
described in U.S. Pat. No. 5.288,514. also incorporated 
herein by reference. 

The VLS1PS™ methods arc preferred for making the 
compounds and libraries of the present invention. The 
surface of a solid support, optionally modified with spacers 
having photolabilc protecting groups such as NVOC and 
McNPOC, is illuminated through a photolithographic mask, 
yielding reactive groups (typically hydroxyl groups) in the 
illuminated regions. A 3'-0-phosphoramidiic activated 
dcoxynudcosidc (protected at the 5'-hydroxyl with a pho- 
tolabilc protecting group) is then presented to the surface 
and chemicai coupling occurs at sites that were exposed to 
light. Following capping, and oxidation, the substrate is 
rinsed and the surface illuminated through a second mask, to 
expose additional hydroxyl groups for coupling. A second 
5-proicctcd. 3'-0-phosphoramidiic activated dcoxynudco- 
sidc is presented to the surface. The selective photodepro- 
tcction and coupling cycles arc repeated until the desired set 
of oligonucleotides is produced. Alternatively, an oligomer 
of from, for example. 4 to 30 nucleotides can be added to 
each of the preselected regions rather than synthesize each 
member in a monomer by monomer approach. At this point 
in the synthesis, cither a flexible linking group or a probe can 
be attached in a similar manner. For example, a flexible 
linking group such as polyethylene glycol will typically 
having an activating group (i.e.. a phosphorarnidile) on one 
end and a photolabilc protecting group attached to the other 
end. Suitably dcrivaUrcd polyethylene glycol Unking groups 
can be prepared by the methods described in Durand, ct al. 
Nucleic Acids Res. 18:6353-6359 (1990). Briefly, a poly- 
ethylene glycol (i.e., hcxacthylenc glycol) can be mono- 
protected using MeNPOC-chloridc. Following purification 
of the mono-protected glycol, the remaining hydroxy moiety 
can be activated with 2-cyanocthyl-N,N-diisopropylami- 
nochlorophosphiic. Once the flexible linking group has been 
attached to the first oligonucleotide (X 1 ). deprotcction and 



5,556,752 



13 



14 



coupling cycles will proceed using S'-protected, 3*-0-phos- 
phoramidite activated deoxynucleosides or intact oligomers. 
Prober can be attached in a manner similar to thai used for 
the flexible linking group. When the desired probe is itself 
an oligomer, ii can be formed other in stepwise fashion oc 5 
the immobilized oligonucleotide or it can be separately 
synthesized and coupled to the immobilized oligomer in a 
single step. For example, preparation of confonnationally 
restricted p-tum mimetics will typically involve synthesis of 
an oligonucleotide as described above, in which the last 10 
nucleoside monomer will be derivaozed with an aminoallcyl- 
funcuonalized phosphoramidite. See, U.S. PaL No. 5,288, 
514, previously incorporated by reference. The desired 
peptide probe is typically formed in the direction from 
carboxyl to amine terminus. Subsequent coupling of a 15 
3-succinylated nucleoside, for example, provides the first 
monomer in the construction of the complementary oligo- 
nucleotide strand (which is carried out by the above meth- 
ods). Alternatively, a library of probes can be prepared by 
first derivatizing a solid support with multiple poly (A) or 20 
polyfT) oligonucleotides which are suitably protected with 
photolabile protecting groups, deprotecting a; known sites 
and constructing the probe at those sites, then coupling the 
complementary polyfT) or poly(A) oligonucleotide. 

Row Channel or Spotting Methods 25 

Additional methods applicable to library synthesis on a 
single substrate are described in co-pending applications 
SerT No. 07/980,523, filed Nov. 20, 1992, and U.S. Pat. No. 
5.384.261. incorporated herein by reference for all purposes. 
In the methods disclosed in these applications, reagents are 30 
delivered to the substrate by either (1) flowing within a 
channel defined on predefined regions or (2) "spotting" on 
predefined regions. However, other approaches, as well as 
combinations of spotting and flowing, may be employed. In 
each instance, certain activated regions of the substrate arc 35 
mechanically separated from other regions when the mono- 
mer solutions arc delivered to the various reaction sites. 

A typical "flow channel" method applied to the com- 
pounds and libraries of the present invention can generally 
be described as follows. Diverse polymer sequences are 40 
synthesized at selected regions of a substrate or solid support 
by forming Mow channels on a surface of the substrate 
through which appropriate reagents flow or in which appro- 
priate reagents arc placed. For example, assume a monomer 
"A" is to be bound to the substrate in a first group of selected * $ 
regions. If necessary, all or part of the surface of the 
substrate in all or a pan of the selected regions is activated 
for binding by. for example, flowing appropriate reagents 
through all or some of the channels, or by washing the entire 
substrate with appropriate reagents. After placement of a 50 
channel block on the surface of the substrate, a reagent 
having the monomer A flows through or is placed in all or 
some of the channel(s). The channels provide fluid contact 
to the first selected regions, thereby binding the monomer A 
on the substrate directly or indirectly (via a spacer) in the 55 
first selected regions. 

Thereafter, a monomer B is coupled to second selected 
regions, some of which may be included among the first 
selected regions. The second selected regions will be in fluid 
contact with a second flow channel (s) through translation. 60 
rotation, or replacement of the channel block on the surface 
of the substraie; through opening or closing a selected valye; 
or through deposition of a layer of chemical or photoresist 
If necessary, a step is performed for activating at least the 
second regions. Thereafter, the monomer B is flowed 63 
through or placed in the second flow channel(s), binding 
monomer B at the second selected locations. In this particu- 



lar example, the resulting sequences bound to the substrate 
at this stage of processing will be, for example, A, B, and 
AB. The process is repeated to form a vast array of 
sequences of desired length at known locations on the 
substraie. 

After the substrate is activated, monomer A can be flowed 
through some of the channels, monomer B can be flowed 
through other channels, a monomer C can be flowed through 
still other channels, etc. In this manner, many or all of the 
reaction regions arc reacted with a monomer before the 
channel block must be moved or the substrate must be 
washed and/or reactivated. By making use of many or all of 
the available reaction regions simultaneously, the number of 
washing and activation steps can be minimized. 

One of skill in the an will recognize that there are 
alternative methods of forming channels or otherwise pro- 
tecting a portion of the surface of the substrate. For example, 
according to some embodiments, a protective coating such 
as a hydrophUic or hydrophobic coating (depending upon 
the nature of the solvent) is utilized over portions of the 
substraie to be protected, sometimes in combination wiih 
materials that facilitate wetting by the reactant solution in 
other regions. In this manner, the flowing solutions are 
further prevented from passing outside of their designated 
flow paths. 

The "spotting" methods of preparing compounds and 
libraries of the present invention can be implemented in 
much the same manner as the flow channel methods. For 
example, a monomer A can be delivered to and coupled with 
a firs: group of reaction regions which have been appropri- 
ately activated. Thereafter, a monomer B can be delivered to 
and reacted with a second group of activated reaction 
regions. Unlike the flow channel embodiments described 
above, reactants are delivered by directly depositing (rather 
than flowing) relatively small quantities of them in selected 
regions. In some steps, of course, the entire substrate surface 
can be sprayed or otherwise coated with a solution. In 
preferred embodiments, a dispenser moves from region to 
region, depositing only as much monomer as necessary at 
each stop. Typical dispensers include a micropipcttc to 
deliver the monomer solution to the substrate and a robotic 
system to control the position of the micropipets with 
respect to the substrate, or an ink-jet printer. In other 
embodiments, the dispenser includes a scries of tubes, a 
manifold, an array of pipettes, or the like so that various 
reagents can be delivered to the reaction regions simulta- 
neously. 

Pin-Based Methods 

Another method which is useful for the preparation of 
compounds and libraries of the present invention involves 
"pin based synthesis." This method is described in detail in 
U.S. PaL No. 5.288.514. previously incorporated herein by 
reference. The method utilizes a substrate having a plurality 
of pins or other extensions. The pins are each insened 
simultaneously into individual reagent containers in a tray. 
In a common embodiment, an array of 96 pins/containers is 
utilized. 

Each tray is filled with a particular reagent for coupling in 
a particular chemical reaction on an individual pin. Accord- 
ingly, the trays will often contain different reagents. Since 
the chemistry disclosed herein has been established such that 
a relatively similar set of reaction conditions may be utilized 
to perform each of the reactions, it becomes possible to 
conduct multiple chemical coupling steps simultaneously. In 
the first step of the process the invention provides for the use 
of substrate(s) on which the chemical coupling steps are 
conducted. The substrate is optionally provided with a 
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> spacer having active sites. In ihc particular pay of oligo- 
nucleotides, for example, the spacer may be selected from a 
wide variety of molecules which can be used m organic 
environments associated with synthesis as weii as aqueous 
environments associated with binding studies. Examples of s 
suitable spacers are polyeihylcncglycols, dicarboxylic acids, 
polyamincs and alkylenes, substituted with, for example, 
raethoxy and cthoxy groups. Additionally, the spacers will 
have an active site on the distal end. The active sites are 
optionally protected initially by protecting groups. Among a 10 
wide variety of protecting groups which are useful are 
FMOC BOC. t-butyl esters, i-butyl ethers, and the like. 
Various exemplary protecting groups are described in, for 
example. Atherton et al., Solid Phase Peptide Synthesis, IRL 
Press (1989), incorporated herein by reference. In some 15 
embodiments, the spacer may provide for a cleavable func- 
tion by way of, for example, exposure to acid or base. 
Libraries on Multiple Substrates 
Bead Based Methods 

Yet another method which is useful for synthesis of 20 
compounds and libraries of the present invention involves 
"bead based synthesis." A general approach for bead based 
synthesis is described copending application Ser, Nos. 
07/762^22 (filed Sep. 18. 1991 now abandoned); 07/946. 
239 (filed Sep. 16, 1992); 06/146,886 (filed Nov. 2, 1993); 25 
07/876,792 (filed Apr. 29, 1992) and PCT/US93/04145 
(filed Apr. 28. 1993). the disclosures of which arc incorpo- 
rated herein by reference. 

For the synthesis of molecules such as oligonucleotides 
on beads, a large plurality of beads arc suspended in a 30 
suitable carrier (such as water) in a container. The beads arc 
provided with optional spacer molecules having an active 
site. The active site is protected by an optional protecting 
group. 

In a first step of the synthesis, the beads arc divided for 35 
coupling into a plurality of containers. For the purposes of 
this brief description, thr number of containers will be 
limited to three, and the monomers denoted as A, B, C, D, 
E. and F. The protecting groups arc then removed and a firsi 
portion of the molecule to be synthesized is added to each of 40 
the three containers (i. c, A is added to container 1, B is 
added to container 2 and C is added to container 3). 

Thereafter, the various beads arc appropriately washed of 
excess reagents, and remixed in one container. Again, it will 
be recognized thai by virtue or the large number of beads *$ 
utilized at the outset, there will similarly be a large number 
of beads randomly dispersed in the container, each having a 
particular first portion of the monomer to be synthesized on 
a surface thereof. 

Thercartcr, the various beads arc again divided for cou- 30 
pling in another group of three containers. The beads in the 
first container arc dcprotcctcd and exposed to a second 
monomer (D), while the beads in the second and third 
containers arc coupled to molecule portions E and F respec- 
tively Accordingly, molecules AD. BD. and CD will be 35 
present in the first container, while AE, BE, and CE will be 
present in the second container, and molecules AF. BF, and 
CF will be present in the third container. Each bead, how- 
ever, will have only a single type of molecule on its surface. 
Thus. aJ I of the possible molecules formed from the first 60 
portions A. B, C, and the second portions D, E, and F have 
been formed. 

The beads arc then rceombincd into one container and 
additional steps such as arc conducted to complete the 
synthesis of the polymer molecules. In a preferred cmbodi- 63 
menu the beads arc lagged with an identifying tag which is 
unique to the particular double-stranded oligonucleotide or 



probe which is present on each bead. A complete description 
of identifier tags for use in synthetic libraries is provided in 
co-pending application. Ser. No. 08/146,886 (filed Nov. 2. 
1993) previously incorporated by reference for ail purposes. 
Methods of Library Screening 

A library prepared according to any of the methods 
described above can be used to screen for receptors having 
high affinity for cither unimolecular. double -stranded oligo- 
nucleotides or conformatiOQally restricted probes. In one 
group of embodiments, a solution containing a marked 
(labelled) receptor is introduced to the library and incubated 
for a suitable period of time. The library is then washed free 
of unbound receptor and the probes or doable -stranded 
oligonucleotides having high affinity for the receptor are 
identified by identifying those regions on the surface of the 
library where markers arc located. Suitable markers include, 
but are not limited to, radiolabels, chromophores, fluoro- 
pnores, chemi luminescent moieties, and transition metals. 
Alternatively, the presence of receptors may be detected 
using a variety of other techniques, such as an assay with a 
labelled enzyme, antibody, and the like. Other techniques 
using various marker systems for detecting bound receptor 
will be readily apparent to those skilled in the art. 

In a preferred embodiment, a library prepared on a single 
solid support (using, for example, the VLSIPS™ technique) 
can be exposed to a solution containing marked receptor 
such as a marked antibody. The receptor can be marked in 
any of a variety of ways, but in one embodiment marking is 
effected with a radioactive label. The marked ami body binds 
with high affinity to an immobilized antigen previously 
localized on the surface. After washing the surface free of 
unbound receptor, the surface is placed proximate to x-ray 
film or phosphorimagcrs to identify the antigens thai arc 
recognized by the antibody. Alternatively, a fluorescent 
marker may be provided and detection may be by way of a 
charge-coupled device (CCD), fluorescence microscopy or 
laser scanning. 

When autoradiography is the detection method used, the 
marker is a radioactive label, such as "P. The marker on ihc 
surface is exposed to X-ray film or a phosphorimagcr, which 
is developed and read out on a scanner. An exposure time of 
about 1 hour is typical in one embodiment. Fluorescence 
detection using a fluorophorc label, such as fluorescein, 
attached to the receptor will usually require shorter exposure 
limes. 

Quantitative assays for receptor concentrations can also 
be performed according to the present invention. In a direct 
assay method, the surface containing localized probes pre- 
pared as described above, is incubated with a solution 
containing a marked receptor for a suitable period of lime. 
The surface is then washed free of unbound receptor. The 
amount of marker present at predefined regions of the 
surface is then measured and can be related to the amount of 
receptor in solution. Methods and conditions for performing 
such assays arc well-known and arc presented in, for 
example. L. Hood ct al., Immunology, Benjamin/Cummings 
(1978), and E. Harlow el ah. Antibodies. A laboratory 
Manual, Cold Spring Harbor Laboratory, (1988). Sec. also 
U.S. PaL No. 4376,1 10 for methods of performing sandwich 
assays. The precise conditions for performing these steps 
wilt be apparent to one skilled in the art. 

A competitive assay method for two receptors can also be 
employed using the present invention. Methods of conduct- 
ing competitive assays arc known to those of skill in the an. 
One such method involves immobilizing conformaiionally 
restricted probes on predefined regions of a surface as 
described above. An unmarked first receptor is then bound 
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to the probes on the surface having a known specific binding 
affinity for the receptors. A solution containing a marked 
second receptor is then introduced to the surface and incu- 
bated for a suitable time. The surface is then washed free of 
unbound reagents and the amount of marker remaining on j 
the surface is measured. In another form of competition 
assay, marked and unmarked receptors can be exposed to the 
surface simultaneously. The amount of marker remaining on 
predefined regions of the surface can be related to the 
amount of unknown receptor in solution. Yet another form of 1 1 
competition assay will utilize two receptors having different 
labels, for example, two different chromophores. 

In other embodiments, in order to detect receptor binding, 
the double-stranded oligonucleotides which are formed with 
attached probes or with a flexible linking group will be v. 
treated with an intercalating dye, preferably a fluorescent 
dye. The library can be scanned to establish a background 
fluorescence. After exposure of the library to a receptor 
solution, the exposed library will be scanned or illuminated 
and examined for those areas in which fluorescence has 2C 
changed. Alternatively, the receptor of interest can be 
labeled with a fluorescent dye by methods known to those of 
skill in the an and incubated with the library of probes. The 
library can then be scanned or illuminated, as above, and 
examined for areas of fluorescence. y 

In instances where the libraries are synthesized on beads 
in a number of containers, the beads are exposed to a 
receptor of interest. In a preferred embodiment the receptor 
is fluorescently or radioactively labelled. Thereafter, one or 
more beads are identified that exhibit significant levels of, jo 
for example, fluorescence using one of a* variety of tech- 
niques. For example, in one embodiment, mechanical sepa- 
ration under a microscope is utilized The identity of the 
molecule on the surface of such separated beads is then 
identified using, for example, NMR, mass spectrometry. 35 
PCR amplification and sequencing of the associated DNA, 
cr the like. In another embodiment, automated sorting (i.e., 
fluorescence activated cell sorting) can be used to separate 
beads (bearing probes) which bind to receptors from those 
which do noi bind. Typically the beads will be labeled and 40 
identified by methods disclosed in Nccdcls, et al., Proc. 
Natl Acad. Sci. USA 90:10700-10704 (1993). incorporated 
herein by reference 

The assay methods described above for the libraries of the 
present invention will have tremendous application in such 45 
endeavors as DNA "footprinting" of proteins which bind 
DNA. Currently. DNA footprinling is conducted using 
DNasc I digestion of double-stranded DNA in the presence 
of a putative DNA binding protein. Gel analysis of cut and 
protected DNA fragments then provides a "footprint" or 50 
where the protein contacts the DNA. This method is both 
labor and time intensive. See, Galas et al.. Nucleic Acid Res. 
53157(1978). Using the above methods, a "footprir.r could 
be produced using a single array of unimolccular, double- 
stranded oligonucleotides in a fraction of the time of con- 55 
vcniional methods. Typically, the protein will be labeled 
with a radioactive or fluorescent species and incubated with 
a library of unimolecular, double-stranded DNA. Phospho- 
rimaging or fluorescence detection will provide a footprint 
of those regions on the library where the protein has bound. 60 
Alternatively, unlabeled protein can be used. When unla- 
beled protein is used, the double-stranded oligonucleotides 
in the library will all be labeled with a marker, typically a 
fluorescen; marker. Incorporation of a marker into each 
member of the library can be carried out by terminating the 65 
oligonucleotide synthesis with a commercially available 
fluorescing pnospooramidite nucleotide derivative. Follow- 
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ing incubation with the unlabeled protein, the library will be 
treated with DNase I and examined for areas which are 
protected from cleavage. 

The assay methods described above for the libraries of the 
present invention can also be used in reverse drug discovery. 
In such an application, a compound having known pharma- 
cological safety or other desired properties (e.g., aspirin) 
could be screened against a variety of double-stranded 
oligonucleotides for potential binding. If the compound is 
► shown to bind to a sequence associated with, for example, 
tumor suppression, the compound can be further examined 
for efficacy in the related diseases. 

In other embodiments, probe arrays comprising p-tum 
mirr.etics can be prepared and assayed for activity against a 
particular receptor, p-tum mimetics ore compounds having 
molecular structures similar to (J- turns which are one of the 
three major components in protein molecular architecture, 
p-tums are similar in concept to hairpin turns of oligonucle- 
otide strands, and are often critical recognition features for 
various protein-ligand and proiein-protein interactions. As a 
result, a library of {j-lum mimetic probes can provide or 
suggest new therapeutic agents having a particular affinity 
for a receptor which will correspond to the affiniiy exhibited 
by the ^-tum and its receptor. 
Bioelcctronic Devices and Methods 

In another aspect, the present invention provides a method 
for the bioelectronic detection of sequence-specific oligo- 
nucleotide hybridization. A general method and device 
which is useful in diagnostics in which a biochemical 
species is attached to the surface of a sensor is described in 
U.S. Pat. No. 4,562.157 (the Lowe patent), incorporated 
herein by reference. The present method utilizes arrays of 
immobilized oligonucleotides (prepared, for example, using 
VLSIPS™ technology) and the known photo-induced elec- 
tron transfer which is mediated by a DNA double helix 
structure. See, Murphy et al.. Science 262:1025-1029 
(1993). This method is useful in hybridizationbascd diag- 
nostics, as a replacement for fluorescence-based detection 
systems. The method of bioelcctronic detection also offers 
higher resolution and potentially higher sensitivity than 
earlier diagnostic methods involving sequencing/detecting 
by hybridization. As a result, this method finds applications 
in genetic mutation screening and primary sequencing of 
oligonucleotides. The method can also be used for Sequenc- 
ing By Hybridization (SBH), which is described in co- 
pending application Scr. Nos. 08/082,937 (filed Jun. 25. 
1993 now abandoned) and 08/1 68,904 (filed Dec. 15, 1993). 
each of which arc incorporated herein by reference for aJl 
purposes. This method uses a set of short oligonucleotide 
probes of defined sequence to search for complementary 
sequences on a longer large: strand of DNA. The hybrid- 
ization pattern is used to reconstruct the target DNA 
sequence. Thus, the hybridization analysis of large numbers 
of probes can be used to sequence long stretches of DNA. In 
immediate applications of this hybridization methodology, a 
small number of probes can be used to interrogate local 
DNA sequence. 

In the present inventive method, hybridization is moni- 
tored using bioeJectronic detection. In this method, the target 
DNA, or first oligonucleotide, is provided with an electron- 
donor tag and then incubated with an array of oligonucle- 
otide probes, each of which bears an electron-acceptor tag 
and. occupies a known position on the surface of the array. 
After hybridization of the first oligonucleotide to the array 
has occurred, the hybridized array is illuminated to induce 
an electron transfer reaction in the direction of the surface of 
the array. The electron transfer reaction is then detected al 
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the location on the surface where hybridization has taken 
place. Typically, each of the oligonucleotide probes in an 
amy will have an attached electron-acceptor lag located 
near (he surface of the solid support used in preparation of 
the array, In embodiments fn which the arrays are prepared 5 
by lig ht-directed methods (i.c, typically 3* to 5* direction), 
the electronacccpior tag will be located near the 3' position. 
The electron- acceptor tag can be attached either to the 3' 
monomer by methods known to those of skill in the art, or 
it can be attached to a spacing group between the 3' 10 
monomer and the solid support. Such a spacing group will 
have, in addition to functional groups for attachment to the 
solid support and the oligonucleotide, a third functional 
group for attachment of the elcaronacccpior tag. The target 
oligonucleotide will typically have the electron-donor lag 15 
attached at the 3' position. Alternatively* the target oligo- 
nucleotide can be incubacd with the array in the absence of 
an electron-donor tag. Following incubation, the electron- 
donor tag can be added in solution. The electron-donor tag 



ably, the spacer group is from 3 to 12 atoms in length and 
will be as described above for the surface modifying portion 
of the spacer group, L'. 

The oligonucleotides which are attached to the spacer 
group can be formed by any of the solid phase techniques 
which are known to those of skill in the art. Preferably, the 
oligonucleotides axe formed one base at a time in the 
direction of the 3' terminus to the 5' terminus by the 
"light-directed" methods described above. The oligonucle- 
otide can then be modified at the 3' end to attach the 
electron-acceptor tag. A number of suitable methods or 
attachment are known. For example, modification with the 
reagent Aminolink2 (from Applied Biosy stems. Inc.) pro- 
vides a terminal phosphate moiety which is derivauzed with 
an aminohciyl phosphate ester. Coupling of a carboxylic 
acid, which is present on the electron-acceptor tag, to the 
amine can then be carried out using HOBT and DCC. 
Alternatively, synthesis of the oligonucleotide can begin 
with a suitably derivatized and protected monomer which 



will then intercalate into those regions where hybridization 20 can then be dcprotcctcd and coupled to the electron- acceptor 



has occurred. An electron transfer reaction can then be 
detected in those regions having a continuous DN A double 
helix. 

The electron-donor tag can be any of a variety of com- 
plexes which participate in electron transfer reactions and 25 
which can be attached to an oligonucleotide by a means 
which docs not interfere with the electron transfer reaction. 
In preferred embodiments, the electron-donor tag is a ruthe- 
nium (II) complex, more preferably a ruthenium (U) 
(phcn'Jjtdppz) complex. 30 

The electron-acceptor tag can be any species which, with 
the electron-donor tag, will participate in an electron transfer 
reaction. An example of an electron-acceptor tag is a 
rhodium (111) complex. A preferred electron-acceptor tag is 
a rhodium (III) (phi)j(phcn') complex. 35 

In a particularly preferred embodiment, the electron- 
donor tag is a ruthenium (II) (phcn'^dpjK) complex and the 
electron- acceptor tag is a rhodium (III) (phi)j(phcn') com- 
plcx. 

In still another aspect, the present invention provides a 40 
device for the bioclcctronic detection of sequence-specific 
oligonucleotide hybridization. The device will typically con- 
sist of a sensor having a surface to which an array of 
oligonucleotides arc attached. The oligonucleotides will be 
attached in pre-defined areas on the surface of the sensor and 45 
have an electron- acceptor tag attached to each oligonucle- 
otide. The electron- acceptor tag will be a tag which is 
capable of producing an electron transfer signal upon illu- 
mina.ion of a hybridized species, when the complementary 
oligonucleotide bears an clectrondonating tag. The signal 50 
will be in the direction of the sensor surface and be detected 
by the sensor. 

In a preferred embodiment, the sensor surface will be a 
silicon -based surface which can sense the electronic signal 
induced and. if necessary, amplify the signal. The metal 55 
contacts on which the probes will be synthesized can be 
treated with an oxygen plasma prior to synthesis of the 
probes to enhance the si lane adhesion and concentration on 
the surface. The surface will further comprise a multi-gated 
held effect transistor, with each gate serving as a sensor and 60 
different oligonucleotides attached to each gate. The oligo- 
nucleotides will typically be attached to the metal contacts 
or. the sensor surface by means of a spacer group. 

The spacer group should not be too long, io order to 
ensure that the sensing function of the device is easily 65 
activaicd by the binding interaction and subsequent illumi- 
nation of the "tagged" hybridized oligonucleotides. Prcfcr- 



tag once the complete oligonucleotide has been synthesized. 

The silica surface can also be replaced by silicon nitride 
or ox> nitride, or by an oxide of another metal, especially 
aluminum, titanium (IV) or iron (III). The surface can also 
be any other film, membrane, insulator or semiconductor 
overlying the sensor which will not interfere with the 
detection of electron transfer detection and to which an 
oligonucleotide can be coupled. 

Additionally, detection devices other than an FET can be 
. used. For example, sensors such as bipolar transistors, MOS 
transistors and the like arc also useful for the detection of 
electron transfer signals. 
Adhcsivcs 

In still another aspect, the present invention provides an 
adhesive comprising a pair of surfaces, each having a 
plurality of attached oligonucleotides, wherein the singlc- 
strzndcd oligonucleotides on one surface are complementary 
to the single-stranded oligonucleotides on the other surface. 
The strength and position/orientation specificity can be 
controlled using a number of factors including the number 
and length of oligonucleotides on each surface, the degree of 
complementary, and the spatial arrangement of complemen- 
tary oligonucleotides on the surface. For example, increas- 
ing the number and length of the oligonucleotides on each 
surface will provide a stronger adhesive. Suitable lengths of 
oligonucleotides arc typically from about 10 to about 70 
nucleotides. Additionally, the surfaces of oligonucleotides 
can be prepared such that adhesion occurs in an extremely 
position -specific manner by a suitable arrangement of 
complementary oligonucleotides in a specific pattern. Small 
deviations from the optimum spatial arrangement arc ener- 
getically unfavorable as many hybridization bonds must be 
broken and arc not reformed in any other relative orienta- 
tion. 

The adhcsivcs of the present invention will find use in 
numerous applications. Generally, the adhcsivcs arc useful 
for adhering two surfaces 10 one another. More specifically, 
the adhesives will find application where biological com- 
patibility of the adhesive is desired. An example of a 
biological application involves use in surgical procedures 
where tissues must be held in fixed positions during or 
following the procedure. In this application, the surfaces of 
the adhesive will typically be membranes which are com- 
patible with the tissues to which they arc attached. 

A particular advantage of the adhesives of the present 
invention is that when they arc formed in an orientation 
specific manner, the adhesive portions will be "self- finding," 
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that is the system will go lo the thermodynamic equilibrium 
in which the two sides are matched in the predetermined, 
orientation specific manner. 

EXAMPLES 5 

Example 1 

This example illustrates the general synthesis of an array 
of mi molecular, double-stranded oligonucleotides on a solid 
support. 

UnimolecuJar double stranded DNA molecules were syn- 
thesized on a solid support using standard Hgto-drrected 
methods (VLSIPS™ protocols). Two hcxacthylcnc glycol 
(PEG) linkers were used to covalently attach the synthesized 1S 
oligonucleotides to the derivatized glass surface. Synthesis 
of the first (inner) strand proceeded one nucleotide at a time 
using repeated cycles of photo-deprotection and chemical 
coupling of protected nucleotides. The nucleotides each had 
a protecting group on the base portion of the monomer as 20 
well as a photolabilc MeNPoc protecting group on the 5" 
hydroxy 1 ;. Upon completion of the inner strand, another 
MeNPoc -pro tec ted PEG linker was covalently artached to 
the 5' end of the surface-bound oligonucleotide. After addi- 
tion of the internal PEG linker, the PEG is pholodeprotected, 2s 
and the synthesis of the second strand proceeded in the 
norma] fashion. Following the synthesis cycles, the DNA 
bases were deprotected using standard protocols. The 
sequence of the second (outer) strand, being complementary 
to that of the inner strand, provided molecules with short, 30 
hydrogen bonded, uni molecular double- stranded structure 
as a result of the presence of the internal flexible PEG linker. 

An array of 16 different molecules were synthesized 00 a 
derivatized glass slide in order to determine whether short, 
unimolecular DNA structures could be formed on a surface 35 
and whether they could adopt structures that arc recognized 
by proteins. Each of the 16 different molecular species 
occupies a different physical region on the glass surface so 
that there is a one-to-one correspondence between molecular 
identity and physical location. The molecules are of the form 40 

S-P-P-C-C-AA'-A/T-An'-A/T-G-C-P-G-C-A/T-A/T-A/T- 
A/T-G-G-F 

where S is the solid surface having silyl groups, P is a PEG 
linker, A. C, G, and T arc the DNA nucleotides, and F is a 
fluorescent tag. The DNA sequence is listed from the 3' to 45 
the 5' end (the 3' end of the DNA molecule is attached to the 
solid surface via a silyl group and 2 PEG linkers). The 
sixteen molecules synthesized on the solid support differed 
in the various permutations of A and T in the above formula. 

50 

Example 2 

This example illustrates the ability of a library of surface- 
bound, unimolecular, double-stranded oligonucleotides to 
exist in duplex form and to be recognized and bound by a 33 
protein. 

A library of 16 different members was prepared as 
described in Example 1. The 1 6 molecules all have the same 
composition (same number of As, Cs, Gs and Ts). but the 
order is different. Four of the molecules have an outer strand 60 
that is 100% complementary to the inner strand (these 
molecules will be referred to as DS, doublestranded, below). 
One of the four DS oligonucleotides has a sequence that is 
recognized by the restriction enzyme EcoRl. If the molecule 
can loop back and form a DNA duplex, it should be 63 
recognized end cut by the restriction enzyme, thereby releas- 
ing the fluorescent tag. Thus, the action of the enzyme 



provided a functional test for DNA structure* and also served 
to demonstrate that these structures can be recognized at the 
surface by proteins. The remaining 12 molecules had outer 
strands that were not complementary to their inner strands 
(referred to as SS, single-stranded, below). Of these, three 
had an outer strand and three had an inner strand whose 
sequence was an EcoRl half-site (the sequence on one 
strand was correct for the enzyme, but the other half was 
not). The solid support with an array of molecules on the 
surface is referred to as a "chip" for the purposes of the 
following discussion. The presence of fluorescently labelled 
molecules on the chip was detected using confocal fluores- 
cence microscopy. The action of various enzymes was 
determined by monitoring the change in the amount of 
fluorescence from the molecules on the chip surface (e.g. 
"reading* 1 the chip) upon treatment with enzymes that can 
cut the DNA and release the fluorescent tag at the 5* end. 

The three different enzymes used to characterize the 
structure of the molecules on the chip were: 

1) Mung Bean Nuclease — sequence independent, single- 
strand specific DNA endonuclease; 

2) DNasc I— sequence independent, double-strand spe- 
cific endonuclease; 

3) EcoRl— restriction endonuclease that recognizes the 
sequence (5*-y) 

GAATTC in double stranded DNA, and cuts between the 
G and the first A. Mung Bean Nuclease and EcoRl were 
obtained from New England Biolabs, and DNase 1 was 
obtained from Boehringcr Mannheim. All enzymes were 
used at a concentration of 200 units per mL in the buffer 
recommended by the manufacturer. The enzymatic reactions 
were performed in a 1 mL flow cell at 22° C. and were 
typically allowed to proceed for 90 minutes. 

Upon treatment of the chip with the enzyme EcoRl, the 
fluorescence signal in the DS EcoRl region and the 3 SS 
regions with the EcoRl half-site on the outer strand was 
reduced by about 10% of its initial value. This reduction was 
at least 5 times greater than for the other regions of the chip, 
indicating that the action of the enzyme is sequence specific 
on the chip. Il was not possible to determine if the factor is 
greater than 5 in these preliminary experiments because of 
uncertainty in the constancy or the fluorescence background. 
However, because the purpose of these early experiments 
was 10 determine whether unimolecular double- stranded 
structures could be formed and whether they could be 
specifically recognized by proteins (and not to provide a 
quantitative measure of enzyme specificity), qualitative dif- 
ferences between the different synthesis regions were suf- 
ficient. 

The reduction in signal in 1 he 3 SS regions with the EcoRl 
half-site on the outer strand indicated cither that the enzyme 
cuts single-stranded DNA with a particular sequence, or that 
these molecules formed a double-stranded structure that was 
recognized by the enzyme. The molecules on the chip 
surface were at a relatively high density, with an average 
spacing of approximately 100 angstroms. Thus, it was 
possible for the outer strand of one molecule to form a 
double-stranded structure with the outer strand of a neigh- 
boring molecule. In the case of the 3 SS regions with the 
EcoRl half-site on the outer strand, such a bimolecular 
double-stranded region would have the correct sequence and 
structure to be recognized by EcoRl. However, it would 
differ from the unimolecular double-stranded molecules in 
thai toe inner strand remains single-stranded and thus ame- 
nable to cleavage by a single-strand specific endonuclease 
such as Mung Bean Nuclease. Therefore, it was possible to 
distinguish unimolecular from bimolecular double-stranded 
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DNA molecules on the surface by their ability to be cut by 
single and double-slrand specific cndonucl eases. 

In order to remove all molecules that have single- stranded 
structures and to identify uni molecular double- stranded 
molecules, the chip was first exhaustively treated with Mung 5 
Bean Nuclease. The reduction in the fluorescence signal was 
greater by about a factor of 2 for the SS regions of the chip, 
including those with the EcoRl half-site on the outer strand 
that were cleaved by EcoRl, than for the 4 DS regions. 
Following Mung Bean Nuclease treatment, the chip was 10 
treated with either DNase I (which cuts all remaining 
double-stranded molecules) or EcoRl (which should oil 
only the remaining double- stranded molecules with the 
correct sequence). Upon treatment with DNase I, the fluo- 
rescence signal in the 4 DS regions was reduced by at least 15 
5-fold more than the signal in the SS regions. Upon EcoRl 
treatment, the signal in the single DS region with the correct 
EcoRl sequence was reduced by at least a factor of 3 more 
than the signal in any other region on the chip. Taken 
together, these results indicated thai the surface-bound mo)- 20 
ccules synthesized with two complementary strands sepa- 
rated by a flexible PEG linker form intramolecular double- 
stranded structures that were resistant to a single-strand 
specific cndonucleasc and were recognized by both a 
double-strand specific endonudease, and a sequence-spe- 25 
rific restriction enzyme. 
What is claimed is: 

1. A synthetic unimoiecular, double- stranded oligonucle- 
otide library comprising a plurality of different members, 
each member having the formula: 
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Y-L'-X'-U-X* 



wherein, 
Y is a solid support; 

X 1 and X 2 arc a pair of complementary oligonucleotides; 
L l is i spacer; 

L 2 it a linking group having sufficient length such that X 1 
and X 3 form a double-stranded oligonucleotide. 

2. A library in accordance with claim 1. wherein L 2 is a 
polyethylene glycol group. 

3. A library in accordance with claim 1. wherein X 1 and 
X 2 are complementary oligonucleotides each comprising of 
from 6 to 30 nucleic add monomers. 

4. Alibrary in accordance with claim 1, wherein said solid 
support is a silica support and L 1 comprises an aminoalkyl- 
silane and from 1 to 4 hexaethyleneglycols. 

5. A library in accordance with claim 1, wherein said solid 
support is a silica support, L 1 comprises an aminoalkylsiiant 
and from 1 to 4 hexaethyleneglycols, L J is a polycthylcncg- 
lycol group and X 1 and X 2 arc complementary oligonucle- 
otides each comprising of from 6 to 30 nucleic arid mono- 
mers. 

6. A synthetic unimolecular, double-stranded oligonucle- 
otide library of claim 1, wherein a portion of said double- 
stranded oligonucleotides formed by X 1 and X 2 further 
comprise a loop. 



